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Mh, The quotient correlation is defined here as an alternative to Pear- 

■^^ ' son's correlation that is more intuitive and flexible in cases where the 

tail behavior of data is important. It measures nonlinear dependence 
where the regular correlation coefficient is generally not applicable. 
One of its most useful features is a test statistic that has high power 
^H , when testing nonlinear dependence in cases where the Fisher's Z- 

X,/^ • transformation test may fail to reach a right conclusion. Unlike most 

asymptotic test statistics, which are either normal or x^, this test 
statistic has a limiting gamma distribution (henceforth, the gamma 
test statistic). More than the common usages of correlation, the quo- 
tient correlation can easily and intuitively be adjusted to values at 
tails. This adjustment generates two new concepts — the tail quotient 
correlation and the tail independence test statistics, which are also 
gamma statistics. Due to the fact that there is no analogue of the 
correlation coefficient in extreme value theory, and there does not 
f^ ' exist an efficient tail independence test statistic, these two new con- 

f^ . cepts may open up a new field of study. In addition, an alternative 

to Spearman's rank correlation, a rank based quotient correlation, 
is also defined. The advantages of using these new concepts are il- 
^\ • lustrated with simulated data and a real data analysis of internet 

QQ ' traffic. 



1. Introduction. In measuring the linear relation, Pearson's correlation, 

based on the product of z-scores, is extremely widely used. Applications 

^ ' related to correlation coefficients are thriving in many areas. However, there 

does not coexist a term with Pearson's correlation based on the quotient 

as far as we know, and furthermore, there is no analogue of the correlation 



Received December 2006; revised April 2007. 

^Supported in part by NSF Grants DMS-05-05528 and DMS-06-30210, Wisconsin 
Alumni Foundation, and Institute for Mathematical Research, ETH Zurich. 

AMS 2000 subject classifications. 62H20, 62H15, 62G10, 62G20, 62G32, 60G70. 

Key words and phrases. Nonlinear dependence, tail dependence, necessary condition of 
tail independence, testing (tail) independence. 



This is an electronic reprint of the original article published by the 
Institute of Mathematical Statistics in The Annals of Statistics^ 
2008, Vol. 36, No. 2, 1007-1030. This reprint differs from the original in 
pagination and typographic detail. 

i 



2 Z. ZHANG 

coefficient in extreme value theory [de Haan and de Ronde (1998)]. In this 
paper the quotient is chosen for its intuitive appeal. 

This paper introduces a class of quotient correlations which can be used 
as an alternative to Pearson's correlation, and a class of rank based quotient 
correlations which can be used as an alternative to Spearman's rank correla- 
tion. Like both Pearson's correlation coefficients and Spearman's rank cor- 
relation coefficients, the quotient correlation coefficients and the rank based 
quotient correlation coefficients measure the level of agreement between two 
random variables. 

In testing linear (nonlinear) independencies, although much work has been 
done, there is no public consensus on which method is the best. In the liter- 
ature, Fisher (1921) introduces the extremely useful Z-transformation func- 
tion, and Hotelling (1953) states that the best present-day usage in dealing 
with correlation coefficients is based on Fisher (1915, 1921). Recently, Free- 
man and Modarres (2005) showed that after Box-Cox transformation to 
approximate normal scales, Fisher's Z-transformation test is more efficient 
when testing independence between two positive random variables. Besides 
Fisher's Z-transformation test, many other testing procedures have been 
developed. These procedures are summarized in books by Cromwell, Labys 
and Terraza (1994), Cromwell, Hannan, Labys and Terraza (1994), Kanji 
(1999) and Thode (2002), among others. 

We deliver a new test statistic, which we call the gamma test for indepen- 
dence, based on the quotient correlation concept. Considering that almost 
all existing asymptotic tests are normal or ■y^ tests, our test is innovative 
because it has an asymptotic gamma distribution. More importantly, we 
show a theorem which assures that the limiting distribution of the test is 
still a gamma distribution even if one uses parametric estimated distribution 
functions or nonparametric empirical distribution functions to transform the 
data into unit Frechet scales which are assumed in the definition of the test 
statistic, that is, parameter estimations and marginal transformations won't 
affect the limiting distribution of the test statistic. We shall show its effi- 
ciency and the inefficiency of Fisher's Z-transformation test when nonlinear 
dependence occurs. 

There is a growing body of literature on modeling and testing tail de- 
pendence, also known as extremal dependence or asymptotic dependence, 
between the components of a two dimensional random vector, which refers 
to the concurrence of extreme values in each component. Examples dealing 
with asymptotically independent random variables can be found in Wat- 
son (1954), Sibuya (1960), de Haan and Resnick (1977), Chow and Teugels 
(1978), Anderson and Turkman (1991), Joe (1997) and Grady (2000). A 
significant step is due to Ledford and Tawn (1996, 1997). They introduced 
a sub-model which can be used to model tail dependence and nearly tail 



QUOTIENT CORRELATION AND GAMMA TEST 3 

independence. Several papers, such as Peng (1999), Draisma, Drees, Fer- 
reira and de Haan (2004), Ferreira and de Haan (2004) and others, have 
focused on estimation and rare events niodehng, particularly of Dutch coast 
wind and water level extremes. However, it is not clear whether the data are 
tail dependent or not due to different conclusions. In the meantime, Hsing, 
Kluppelberg and Kuhn (2003) develop visualization methods to check tail 
dependence, and Falk and Michel (2004) study a different sub-model and 
graphic checking of tail dependence. These models are certainly useful and 
important in extreme value theory. However, constructing a test statistic 
for tail independence is still an interesting open problem [Campos, Marron, 
Resnick, Park and Jeffay (2003)]. Longin and Solnik (2001) and Campos et 
al. (2003) point out that in the study of extremal dependence, misleading 
results, even conflicting results, are often reported in the literature. 

The quotient correlations can easily and intuitively be adjusted to values 
at tails. This adjustment enables us to construct tail dependence measures 
and to construct efficient test statistics for tail independence. This adjust- 
ment generates two new concepts — the tail quotient correlation and tail 
independence test statistics, which are also gamma statistics and effectively 
detect tail independencies or tail dependencies in all simulation examples to 
be presented here. 

The paper is organized as follows: in Section 2 we introduce the quotient 
correlation coefficient and the rank based quotient correlation coefficient. 
We then in Section 3 develop the gamma test statistic for testing indepen- 
dence. We compare the gamma test with the Fisher's Z-transformation test. 
Section 4 introduces the tail quotient correlation concept. In Section 5 we 
obtain our gamma test statistics for testing tail independence. A necessary 
condition for two random variables being tail independent is presented here. 
In Section 6 simulation examples are analyzed. In Section 7 a real data ex- 
ample of internet traffic is tested. Concluding remarks are given in Section 
8. Technically involved proofs are deferred to Section 9. 

2. The quotient. Suppose X and Y are identically distributed positive 
random variables satisfying P(X > y) > 0, V[X < y) > 0. Then the quo- 
tients between X and Y are Y/X and X/Y . Suppose {(Xj, Yi), i = 1, . . . , n} 
is a bivariate random sample of [X^Y). Then we will have 2n quotients. 

2.1. The quotient correlation coefficient. It is easy to see that both 
maxj<„{Xj/l^} — 1 and maxj<.„{l^/Xj} — 1 are asymptotically nonnegative. 
When these two values are both close to zero for a sufficiently large sample 
size n, one can say that the changing magnitudes of X and Y are very close 
to each other. The same is true for maxj<„{Xj/l^} maxj<„{l^/Xj} — 1 being 
close to zero. 
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We now use the above three quantities to define a new statistical coeffi- 
cient, that is, the quotient correlation coefficient: 

,„ 1 N _ iaa.Xi<n{Yi/Xi} + m&Xi<n{Xi/Yi} - 2 

maxi<n{Yi/Xi} xma.Xi<n{Xi/Yi} -1' 

Some functional properties of q„ are illustrated by looking at the following 

function: 

X -\- V — 2 

f{x,y) = T~ ioT x>l, y>l,x + y>2. 

xy-l 

The case of x + y = 2 is trivial since it implies Xi and Yi are equal. It is easy 

df _ jy-i? ^ n a/ 

dx — {xy-iy^ ^ ^' dy 



to see < fix, y) < 1, /(I, y) = 1, f{x, 1) = 1, and g = -^^ < 0, ^ 

-(fe^ < 0, £4 = -2^^ji^ < 0, which imply /(xi, yi) < /(x^, y^), 
where xi > X2, 2/i > 2/2- 

The properties of the function f{x,y) defined above suggest that, asymp- 
totically, q„ can take values ranging from to 1. The monotonicity of f{x, y) 
suggests that, for a fixed sample size n, the larger the maxj<„{Xj/l^} or 
the maxj<„{l^/Xj}, the smaller the q„, and hence the lower the agree- 
ment of changing magnitudes. The boundary properties of f{x,y), that is, 
f{x, 1) = 1, f{l,y) = 1, suggest that the value of q„ is largely determined by 
the smaller value of maxj<„{Xj/l^} and maxj<„{l^/Xj}. This property is 
very useful, especially when there exist outliers in one of the two sequences. 

The computed q^ value can suggest the dependence magnitude between X 
and Y. For example, when q„ is close to 1, there are co-movements between 
Xi and Yi, and hence, X and Y are nearly completely dependent. 

Notice that in the definition of q^ the values of Xi and Yi are required to 
be positive. In practice, this may not be the case. A variable transformation 
can resolve the issue. Sometimes a simple shift of the data to the right works 
too. For example, we can have Y/ = c — Yi, i = l, . . . ,n, being positive values, 
where c is chosen such that both maxj<„{y//Xj} and maxj<„{Xj/y/} are 
greater than 1. Then we can use Y- and Xi to define q^ as we did in (2.1). 
When qn is close to 1, the movement of Xi is opposite to that of Yi. 

We now give an example that shows even if the supports of X and Y 
are (0, oo), X/Y and Y/X are bounded, and hence, q„ is greater than 
regardless of sample size. 

Example 2.1. Suppose Zi, I = 1,. . . ,L, are independent unit Frechet 
random variables with distribution function F{x) = exp(— 1/x), x > 0. Let 
X = maxi<i<LaiiZi, F = maxi<Ki 0^2^/, where a/j > 0, E/«K = 1> i = 
1,2. Let ci = maxi<L{aii/ai2) and C2 = maxKi(a/2/an), then F{X/Y < 
ci) = P(max/ aiiZi < max; ciai2Zi) = 1, and F{Y/X < C2) = F{maxi ai2Zi < 
maxiC2aiiZi) = 1. Therefore, maxi<n{Yi/Xi} -^ a, maxi<n{Xi/Yi} -^ 
C2, and Qn -^ %tc2-i ' as n ^ 00. 
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This is a special case of multivariate maxima of moving maxima processes 
(or M4 processes) which have been relatively extensively studied in Smith 
and Weissman (1996), Zhang (2004, 2006), Zhang and Shinki (2006) and 
Zhang and Smith (2004a). This example will be used in Section 6. 

Remark 2.2 stresses the importance of having the data used to compute 
q„ in a unified scale. 

Remark 2.2. Suppose {{X* ,Y*,U*), i = 1,. . . ,n} is a trivariate ran- 
dom sample of {X* ,Y* , U*), where X* , Y*, U* are independent. Let Xi = 
U*X*, Yi = U*Y*, then Xi and Yi are dependent. However, X,/Yi = X*/Y* 
which suggests that the value of q^ computed based on Xi and Yi and the 
value of q^ computed based on X* and Yi* are the same. At first glance the 
quotient correlation seemed unusable. Notice that Xi and Xi do not have the 
same distribution. When both Xi and Xi are converted into unit Frechet 
scales, the quotients computed from the resulting variables will no longer 
give the same q^ value. In other words, the quotient correlation coefficient 
is applicable. 

It is natural to use unit Frechet scores to define the quotient measure since 
the limiting distributions of maximum of quotients are a unit Frechet distri- 
bution when each quotient is derived from two independent unit Frechet (or 
unit exponential) random variables. When marginals are not unit Frechet, 
marginal transformation is needed. Nonparametric methods of converting 
the data into unit Frechet scales will be discussed in Section 2.2. For para- 
metric method details, we refer to Smith (2003), Zhang (2005) and Zhang 
and Smith (2004b). 

The quotient correlation q„ measures the nonlinear dependence since the 
quotient itself is a nonlinear bivariate function. One can see that the value 
of qn mainly depends on the tail observations of two random variables X 
and Y. U X and Y are tail independent, the value of q^ turns out to be 
negligibly small. In this case, we propose to replace the maximal quotients 
by the kth largest quotients (or the pth percentiles) in their corresponding 
quotient series. In practice, k should be small (1, 2, 3, etc.) in order to make 
the calculated dependence measures meaningful. Similarly, the choice of p 
should be meaningful. These new statistics will be studied in a different 
project. We focus on (2.1) in this work. 

2.2. The rank based quotient correlation coefficient. Suppose now that 
Xin < -^^271 < • • • < Xnn Siie the ordered sample of Xi,X2,. ■ ■ , Xn ; Yin < 
Y2n < • • • < Ynn are the ordered sample of Yi, 12, • ■ • , ^n- Define the empirical 
distribution function of the X sample by Fx{x) = ^/nJ27=i^{Xi<x)- Then 
Fx{Xkn) = k/n. Similarly, define the empirical distribution function of the Y 
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sample by Fyiy) = ^/nY^^=iI{Yi<y)- Then FyiYkn) = k/n. Therefore, when 
we transform the data into the unit Frechet scale based on the empirical 
distribution functions and ranks, we get the same unit Prechet scaled values 
for two identical ranks in both X and Y samples. 

The transformation can be done with the function — l/[e + log{Fx(^fcn)}] = 
— l/[e + log(fe/n)],fc = 1,2, . . . ,n, where e is a small number to avoid division 
by 0. In practice, however, this small number is sensitive to the transformed 
values depending on the sample size n. Similar discussions hold for the Y 
sample too. Here, we introduce a different procedure to do data transforma- 
tion. 

Using the above arguments, we generate a unit Frechet sample of size n 
and denote Zn) < Z12) < ■ ■ ■ < Z(n) as the ordered sample. The rank based 
quotient correlation is defined as 

^fR_ maXi<n{Z(rank[yj)/^(rank[X^])} + 1^^'^i<n{Z(ra.nk[Xi])/ Z(rsink[Yi])} ~ 2 
maXj<„{Z(rank[yj)/-^(rank[X,])} X maXi<„{Z(i.ank[X,])/^(rank[yi])} " 1 

(2.2) 

One immediate advantage of this definition is that the transformation does 
not depend on any correction term. It is obvious that the definition is also 
based on a simulated random sample of a unit Prechet random variable. We 
have found this form is easy to implement and the simulation results are 
very close to qn, which is based on a parametric transformation. In practice, 
we suggest simulating 10 or more unit Frechet samples of size n, computing 
and reporting the mean or median value of q^. 

After establishing quotient correlation coefficients, in the next section we 
present one of the important application features of quotient correlations, 
that is, their uses in testing independence between random variables. 

3. The gamma test statistic for testing independence. 

3.1. The hypothesis and the test statistic. It is well known that any abso- 
lutely continuous random variable can be converted into a random variable 
with desired properties by marginal transformation. Without loss of gen- 
erality, suppose now {{Xi, Yi), z = 1, . . . ,n} is a bivariate random sample 
of {X,Y) with unit Frechet margins. Then we can formulate the following 
hypothesis test of independence: 

Hq : X and Y are independent versus H^ : X and Y are dependent. 
(3.1) 

Here the superscript c means that the complete data are used in (3.1). Later 
we shall give cases where only tail values are used. 

The following theorem shows the properties of a random sample under 
HS of (3.1). 



QUOTIENT CORRELATION AND GAMMA TEST 7 

Theorem 3.1. If X and Y are independent and have unit Frechet mar- 
gins, and {Xi, Yi), i = 1, . . . ,n, is a random sample from {X,Y), then 
lim„^ooP{ra~"^ X (maxj<„li/Xj + 1) < x} = e~^'^ , and random variables 
maxi<nYi/Xi and minj<„l^/Xj are asymptotically independent — that is, 

lim F\n~^(maxYi/Xi + l] <x, 

(3.2) ^ V - 

n-^(umxXi/Yi + l] < y| = g-^/^-^/^. 

Furthermore, as n — > oo, the random variable q„ is asymptotically gamma 
distributed — that is, 

(3.3) nqn — > C, 
where C, is a gamma(2, 1) random variable. 

A proof of Theorem 3.1 is deferred to Section 9. 

The hniiting distribution of nq„ leads to the following testing procedure 
(the gamma test): if nq„ > Cai where C,a is the upper ath percentile of the 
gamma(2, 1) distribution, Hq of (3.1) is rejected; otherwise, it is retained. 

As mentioned earlier, we can transform the underlying random variables 
to any other scales of interest. This gives the power from a probabilistic point 
of view one may assume without loss of generality that the marginal distri- 
butions are known. Under the null hypothesis, although we do not have any 
parameters that associate one population with another, we do have parame- 
ters in each individual population. Estimators for the marginal distribution 
may influence the asymptotic behavior of tests. In a statistical problem Xi 
could be a unit Prechet random variable resulting from theoretical variable 
transformation, while Xi[n) could be a unit Frechet scaled value resulting 
from a variable transformation using an estimated distribution function. In 
most applications Xi{n) are used in statistical inferences including testing 
tail independencies. 

In the following Theorem 3.2, we introduce a sufficient condition under 
which the marginal effects from the estimated parameter values do not have 
any effects on the asymptotic behavior of tests. 

Theorem 3.2. Suppose 9, 9i, i = 1,. . . ,n, are absolutely continuous ran- 
dom variables with distribution function Gg{x), and rj, r]i, i = 1, . . . ,n, are 
absolutely continuous random variables with distribution function G^(x). 
Denote any estimated distribution function for 6 by Ge{x), and any es- 
timated distribution function for rj by Gr,{x). Define Xi = F~^{Go{9i)), 
Yi = F~^{Gri{r]i)), i = 1, . . . ,n, where Fix) is the unit Frechet distribution 
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function. Suppose Ge{Oi) —^ Ge{Oi), Grj{r]i) ^-4 Grj{r]i), for all i, as n^ oo. 
Then random variables niaxj<„yi/Xj anrf maxj<„Xj/l^ are each asymptot- 
ically gamma distributed, and they are asymptotically independent — that is, 

lim pi n"^ f maxyj/Xj + 1 ) < x, n"M maxXj/yj + 1 I < y 

(3.4) ^ V - 

= ^-l/x-l/y_ 

Furthermore, as n — > oo , the random variable, 

maxi<n{Yi/Xi} + maxi<n{Xi/Yi} - 2 



(3.5) Qn 



maxi<n{yi/Xi} X ma.Xi<n{Xi/Yi} - 1 



r 

is asymptotically gamma distributed — that is, nq.„ — > C, where C is a gamma(2, 1) 
random variable. 

A proof of Theorem 3.2 is deferred to Section 9. 

An immediate practical application is to apply the empirical distribution 
function in Gg{-) and Gr^{-), and we have the following corollary. 

Corollary 3.3. Suppose Xi,...,X„ are observed variables with abso- 
lutely continuous distribution Fx{x), and Yi,...,Yn are observed variables 
with absolutely continuous distribution Fyiy). Define 

1 »i -in 

Px (x) = — — -^ Yl kXi<x) , Fy (y) = — —r-r H hYi<y) ■ 

Then, as n ^^ oo the empirical distribution transformation based quotient 
correlation 

q^ = (ma^{\ogiFx{X,))/log{FY{Yi))} 



(3.6) 



+ max{log(Fy (y,))/log(Fx(X,))} - 2 

«<n 

X fmax{log(Fx(X))/log(Fy(y))} 

X max{log{FY{Yi))/log{Fx{Xi))} - 1 

i<n 

is asymptotically gamma distributed — that is, nq,^ — > (^, where ^ is a gamma(2, 1) 
random, variable. 

Notice that q„ and q^ are asymptotically equal in distribution, and we 
also have nq^ — > (, as n — > oo. We apply q^ in this work. 
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3.2. Power comparisons between the gamma test and Fisher^s Z -transforma- 
tion. We now assume X and Y are normal random variables for the mo- 
ment. Suppose r^y is the sample correlation coefficient of a bivariate ran- 

dom sample of size n. Then Fisher's Z-transformation is W = 2 log{ -^_^^^ ), 
which satisfies E{W) - i log(i±g^), Var(W^) - ^, and Z = ^^ ^ 

A^(0, 1), which in turn allows us to test different hypotheses about the 
correlation — see details in papers by Fisher (1915, 1921), Hotelling (1953), 
Freeman and Modarres (2005) and a book by Hardle and Simar (2003). In 
our applications we test the null hypothesis of pxY = versus the alternative 
hypothesis of pxY 7^ 0. 

When observed variables are actually independent, we have found that 
both Fisher's Z-transformation test and the gamma test have low Type I 
error rate (less than 5%), which, suggests that both tests are practically 
efficient. We now show the efficiency of the gamma test when variables are 
nonlinear dependent. 

Example 3.1. In this example, the correlation coefficient between two 
random variables is zero, while the two random variables are actually de- 
pendent. For example, suppose X is standard normal, and let Y = X"^ . It is 
easy to show cov{X,Y) = 0. Like the previous example, we simulate 100 bi- 
variate random samples of size n, n = 25, . . . , 100. For each random sample, 
we conduct both Fisher's Z-transformation test and the gamma test with 
significance level a = 0.05. At each sample size n, we compute the empiri- 
cal power of each test. The performances are illustrated in the left panel of 
Figure 1. Clearly, one can see that when sample size is larger than 37, the 
gamma test is superior, but Fisher's Z-transformation test is not applicable 
in this instance. 

Needless to say, we are not proposing a test statistic to replace Fisher's 
Z-transformation test. In our point of view, these two tests can be used 
as two primary tests for testing independence since in most cases they will 
agree with each other and both tests are easy to implement. However, if the 
gamma test fails to suggest independence, while Fisher's Z-transformation 
test does suggest independence, the data may be actually drawn from two 
nonlinear ly dependent random variables. 

As a comparison, we performed the rank based quotient tests (replaced 
c[n by q„ ) to the example above. The results are plotted in the right panel 
in Figure 1. One can see that its performance is just as good as (or slightly 
better than) the quotient correlation. 

Notice that the power curve is not very smooth, and they may not be very 
accurate. For the purpose of demonstrating that the gamma test outperforms 
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Fig. 1. Comparison between Fisher's Z -transformation test and the gamma test. Left 
panel is based on q„ and Fisher's Z -transformation function. Right panel is rank based 
test. 

the Fisher's Z-transforniation test in cases when nonhnear dependencies oc- 
cur, we argue that Figure 1 may be sufficiently enough. To estimate the 
power curve much more accurate, one can use monotone smoothing which 
uses the strengths from neighboring points; see He and Shi (1998), for ex- 
ample, we will apply this method in a different study. 



4. The tail quotient correlation. In the literature the characterization 
of tail dependence between random variables is through a tail dependence 
index [Sibuya (1960) and Embrechts, McNeil and Straumann (2002)]. We 
present the definition next. 

Definition 4.1. Suppose random variables X and Y are identically 
distributed with Xp = sup{x G ]R:P(X < x) < 1}. The quantity (if exists): 
A = lim„^^.^ ^{X > u\Y > u) is called the bivariate tail dependence index, 
or the tail dependence index, between X and Y . It quantifies the amount of 
dependence of the bivariate upper tails of X and y. If A > 0, then X and Y 
are tail dependent, otherwise the two random variables are tail independent. 

This definition has been extended to cross-sectional tail dependencies in 
multivariate time series; see Zhang and Huang (2006) for example. Heffer- 
nan, Tawn and Zhang (2007) showed that X and Y are not necessarily iden- 
tically distributed when one studies tail dependencies. However, we restrict 
our study to identically distributed random variables in this work. 

To calculate A, the joint distribution of X and Y must be known. In 
practice, when data is available, the data can be used to estimate the tail 
dependence index based on the empirical distribution functions and a very 
large threshold value u. We seek a new alternative quotient based statistical 
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coefficient that measures the magnitude of tail dependence between two 
variables. 

In general, when we study tail values, on one hand, we should use as 
much tail information as possible; on the other hand, we should use as little 
data from the center and below the threshold as possible. It is natural to 
use values above a threshold. However, this idea raises questions: how do we 
deal with the values below the threshold, and how do we get the quotient of 
the co-responses. Our choice is to use the threshold value for all those values 
below the threshold. 

Using the arguments above, we define the tail quotient correlation coeffi- 
cient as 

^ maxi<n{{u + Wi)/{u + Vj)} + max^<n{(u + Vi)/{u + Wj)} - 2 

"^"'^ " maxi<„{(n + Wi)/{u + Vi)} x maxi<„{(u + Vi)/{u + W,)} - 1 ' 
(4.1) 

where u is a positive threshold, and Wi and Vi are exceedance values over 
the threshold u of positive random variables Xi and Yi respectively. Here 
we assume Xi and Yi are identically distributed. 

Similarly, we can define a rank based tail quotient correlation coefficient. 
With the established notation in (2.2), a rank based tail quotient correlation 
is defined by 



,7^ 



/ r nl(pi/i=0) + -^(rank[Vl/,])l{Wi>0) 

I 1T13jX \ — 

\i<n [ ul(_Vi=0) + Z(ra.nk[Vi\)'^{Vi>0) 



+ max 

i 



(4.2) 



X max 



f ^l(y,=0) +-^(rank[y.])l(V,>0) 
<n \ul(^Wi=0) + -^(rank[H/i])l(Wi>0) 



\i<n I nl(y^=o) +^{rank[y,])l(y,>0) 

i<n \Ml(ty,=o) + -^(rank[Wi]) l(iyi>0) J 

where u is a global threshold value from a unit Frechet distribution function 
and !(.■) is an indicator function. We will use this rank based tail quotient 
correlation to conduct the tail independence test in Section 7. 

One of the important uses of the tail quotient correlation is in the testing 
of tail independence which we study in the next section. Simulation examples 
are illustrated in Section 6. 

5. The gamma test statistics for testing tail independence. The follow- 
ing theorem gives a necessary condition that two random variables are tail 
independent. 
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Theorem 5.1 (Necessary condition). In Definition 4.1 suppose that X 
and Y are unit Frechet random variables, and A = 0, then q„ —^ 0, q^^ —^ 
as n^t CO. 

A proof of Theorem 5.1 is deferred to Section 9. 

It is clear from Theorem 5.1 that a large tail quotient correlation would 
suggest tail dependence, and a zero value of tail quotient correlation would 
suggest that tail dependence is very weak. In most cases, however, the com- 
puted tail quotient correlation coefficient may never be zero. A statistical 
inference problem then arises: How small does a tail quotient correlation co- 
efficient need to be for us to conclude that the two random variables are tail 
independent? This turns out to be a statistical hypothesis inference problem: 

(5.1) Hq : {X, Y) is tail independent <^ Hf : {X, Y) is tail dependent. 

Here the superscript t indicates that the test is for tail (in)dependence only. 

Notice that (2.1) and (3.3) together test the null hypothesis of indepen- 
dence, not a null hypothesis of tail independence. We argue that the tail 
quotient correlation leads to a true tail independent test statistic under Hq 
next. 

Let 

/C r,\ f Xl, X2, ..., X„ 

be an independent array of unit Frechet random variables. Now let {Ui,Qi), 
i = 1, . . . ,n be a bivariate random sequence, where both Ui and Qi are 
correlated and have support over {0,u] for a high threshold value u. Let 
Xui = XiI)^Xi>u} + Uil{x^<u}, Yui = YiI)^Y,>u} + QiI{Y,<u}, i = l,...,n. Then 

Xul \ I Xu2 \ I Xun 



is a bivariate random sequence drawn from two dependent random variables 
Xui and Yui- Notice that Xuil{x^,>u} (= Xili^x,>u}) and y„i/{y^^>„} (= 
YiI{Yi>u}) are independent, but Xuil{x^^<u} {=UiI{x,<u}) and YuiI{Y^^<u} 
(= QiI{Y^<u}) are dependent. In fact, one can easily construct arbitrarily 
dependent structure for values below the threshold u. Consequently, if only 
tail values are concerned, we can assume the tail values are drawn from 
(5.2) under the null hypothesis of tail independence. We have the following 
theorem. 

Theorem 5.2. Suppose Vi and Wi are exceedance values (above u) in 
(5.3), and Ui and Qi have the distribution e^''"e~^'^,0 <x <u. Then 

t t 



-(i+t)/u^ z/0<t<l, 

(5.4) p(:^^:i<t) = <; 1 + ^ i + ^ 



!i±i^^.UJ 1 + t i + t 
i + t~^ i + t'^ 
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(5.5) lim P<jn"^ 
Moreover, 

(5.6) lim P< n 



max{u + Wi)/{u + Vi) + I 

i<n 



< x> = e 



mm {u + Wi)/{u + Vi) 



i<n 



< a; > = 1 — e 



-(l-e-i/")/x 



^(l-e-i/")a 



The random variables maxj<„ {u + Wi)/{u + Vi) and niaxj<„ (u + Vi)/{u + Wi] 
are tail independent — that is, 



lim P< n 



(5.7) 



max TTT- + i 



i<n (u + Vi) 
(l_e-i/ii)/a;_(l„e-i/")/y 



< X, n 



max + 1 

,i<n (u + Wi) 



< 



Furthermore, as n — > oo, the random variable qu,n is asymptotically gamma 
distributed, that is, 

(5.8) nqu,n — > C, 

where C is gamma(2, 1 — e~^'") distributed. 

A proof of Theorem 5.2 is deferred to Section 9. Here are some intu- 
itive interpretations. If max{(u + Wi)/{u + Vi)} or max{(ti + Vi)/{u + Wi)} 
is close to 1, qun will have a larger value, which in turn implies that nq„„ has 
a larger probability to reject Hq or a small p- value will be obtained. A larger 
qun value implies that there exists tail dependence (extreme co-movements) 
between Xi and Yi. 

Equations (4.1) and (5.8) together establish a new test statistic — we call 
it the gamma test for the null hypothesis of tail independence — which can 
be used to determine whether there is tail dependence between two random 
variables. When nqun > Ca, where Ca is the upper ath percentile of the 
gamma(2, 1 — e"^'") distribution, Hq of (5.1) is rejected. 

Considering the threshold value u as a parameter, we obtain a new family 
(4.1) and (5.8) of gamma test statistics. When u tends to zero, we obtain 
the special case of (2.1) and (3.3). 

Notice that as u is increasing, the value of nqu^n is increasing since both 
m.ax{{u + Wi) / {u + Vi)} and m.ax{{u + Vt) / {u + Wi)} are decreasing. Intu- 
itively, a larger value of nqu^n renders to a rejection of the null hypothesis 
more likely. However, as u is increasing, 1 — e"^'"" is decreasing, and a de- 
creasing scale parameter 1 — e~^'" leads to a larger cutoff value Ca- Therefore, 
a truncation (below the threshold value u) of data does not make a rejection 
of Hq of (5.1) more likely. In fact, the gamma test can effectively detect all 
tail independent random variables in our simulation examples. 
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Remark 5.1. The tail quotient correlation coefficient qun can be used 
as a tail dependence measure once the null hypothesis of tail independence 
is rejected. In the literature several tail dependence measures have been pro- 
posed, such as those by Coles, Heffernan and Tawn (1999), Poon, Rockinger 
and Tawn (2001), Campos et al. (2003) and others. These measures and 
others have broad applications as long as tail events are concerned. 

Also, notice that (5.8) holds when exceedance values of Vi and Wi are 
replaced by exceedance values of Vi{n) and Wi{n) resulting from Xi{n) and 
Yi{n) respectively in Theorem 3.2. 

The following theorem tells that one can apply the gamma tests when data 
are drawn from strict bivariate stationary processes which satisfy conditions 
specified in the theorem. 

Theorem 5.3. Suppose {{X!^ ,Y*), i = 1, . . . ,n} is a strict bivariate 
stationary processes with marginal distribution Fx*{-) and -Fy*(-) which 
are absolutely continuous. F{-) is a unit Frechet distribution function. Let 
Xi = F-^{Fx*{X*)), Yi = F-^{FY-'{Y*)),i = l,...,n. Suppose the random 
sequences {^i = Xi/Yi}, {^[ = Yi/Xi} satisfy Leadbetter^s (1974) F){un) and 
D'[un) conditions for Un = nx — 1 , and Davis^ (1982) C\ and C2 conditions. 
Suppose Fx*{-) and Fy*{-) are estimated distribution functions for X^ and 
Y* respectively, andasn^oo, Fx*{X*) -^ Fx*{X*), FyiY*) -^ Fy*{Y*), 
for all i = 1,2, ... , almost surely. Define Xi = F~^{Fx*{X*)) and % = 

F^^{Fy*{Y^)). Then as n^oo, both (3.3) and (5.8) hold, and nq„ — >(,, 
where q„ is defined as the one in (3.5). 

A proof of Theorem 5.3 is deferred to Section 9. 

6. Simulation examples. We have done substantial computation to check 
the efficiency of the gamma test. We have found that the gamma test has 
effectively detected tail (in)dependent random variables in all examples (in- 
cluding the sum and the maximum of a sequence of random variables, and 
variables in M-dependent sequences). Some typical examples are reported 
in the next section. 

6.1. Bivariate random samples. In this section we simulate bivariate ran- 
dom samples of sample size 500 from a bivariate random variable {X,Y). 
The value of p (correlation coefficient) is used to measure the correlation 
(linear dependence) between the two random variables. The value of 9 in 
Gumbel type copulas is determined hy 9 = -y/1 — p. 

Notice that the correlation coefficient between two random variables may 
not exist. If the correlation coefficient does not exist, p does not have a 
specific meaning. 

These simulated bivariate sequences are the following: 
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Table 1 
This table reports p-values obtained when the gamma test was applied to data drawn from 
models (a), (b), (c), (d), (e), (f), (g) and (h). The first column shows how the threshold 

values u were chosen 



Percentile 


(a) 


(b) 


(c) 


(d) 


(e) 


(f) 


(g) 


(h) 


0.8000 


0.2554 


0.0000 


0.0538 


0.0000 


0.6889 


0.0538 


0.0904 


0.0000 


0.8250 


0.2448 


0.0000 


0.0512 


0.0000 


0.6860 


0.0825 


0.0906 


0.0000 


0.8500 


0.2380 


0.0000 


0.0492 


0.0000 


0.6822 


0.1226 


0.0869 


0.0000 


0.8750 


0.2340 


0.0000 


0.0473 


0.0000 


0.6769 


0.1762 


0.0834 


0.0000 


0.9000 


0.2307 


0.0000 


0.0549 


0.0000 


0.6738 


0.2228 


0.1105 


0.0000 


0.9250 


0.2277 


0.0000 


0.0745 


0.0000 


0.6668 


0.2621 


0.1962 


0.0000 


0.9500 


0.2269 


0.0001 


0.0860 


0.0000 


0.6605 


0.2624 


0.1940 


0.0000 


0.9750 


0.2308 


0.0009 


0.1010 


0.0002 


0.6596 


0.2878 


0.1975 


0.0000 



(a) Bivariate random samples drawn from two independent unit Frechet 
random variables. 

(b) Bivariate random samples drawn from the Gumbel copula, 

Ce(i7(ni),if(n2)) = e-(S-i["i°s^^("')>l'^')', ui > 0, us > 0, 

where H{u) is a unit Frechet distribution function. Here 9 = 0.4472. Notice 
that the Gumbel copula is the only Archimedean copula which is also an 
extreme value copula [page 87, Mari and Kotz (2001)]. 

(c) Bivariate random samples drawn from the survival Gumbel copula, 

P(Xn > ni,X2i > u^) = e"(Elj-MP{-'^->«a)]^/^)^ 

ui>0, U2>0, 6* = 0.4472. 

(d) Bivariate random samples drawn from Example 2.1, where 1 < Z < 30 
and the coefficients are simulated values. 

(e) Bivariate random samples drawn from {l/U, 1/(1 — [/)), where U is 
uniform (0, 1). (Sid Resnick contributed this example.) 

(f) Bivariate normal random samples with correlation p = 0.8. 

(g) Bivariate random samples drawn from X = ZiEi, Y = Z2E1, where 
El is exponential 1. Zi and Z2 are unit Frechet. 

(h) Bivariate random samples drawn from two t(4) (Student t with 4 
degrees of freedom) random variables with correlation p = 0.8. 

For these eight examples, (a), (c), (e), (f) and (g) are tail independent 
examples, while (b), (d) and (h) are tail dependent examples. When we con- 
duct the gamma test, the null hypotheses would be expected to be retained 
when data are drawn from any one of (a), (c), (e), (f) and (g), that is, larger 
p-values would be obtained; the null hypotheses would be expected to be 
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rejected when data are drawn from any one of (b), (d) and (h), that is, 
smaller p- values would be obtained. 

In our simulation study we do not use a fixed u value. Instead, the thresh- 
old values are automatically chosen at the lOOpth percentiles of the sam- 
ple data, where p = 0.80, 0.825, 0.875, 0.90, 0.925, 0.95, 0.975, respectively We 
take the smallest value of the two lOOpth percentiles from two different 
sample sequences. Some typical test results from a single gamma test are 
summarized in Table 1. One can immediately see that the gamma test does 
work. 

We have repeated running the program 100 times and recorded the to- 
tal number of rejections of the null hypothesis of tail independence by the 
gamma test. The power of the gamma test among all simulated examples 
is 88%, which is very high. It would be fair enough to say that the gamma 
test is very efficient for testing tail independence. 

7. Application to internet traffic. In network design, especially in in- 
ternet traffic control, where the tail independence or dependence between 
two dependent variables can help networking researchers identify the main 
cause of internet delay, and design a better network structure, there have 
been a series of studies on tail dependence between internet traffic vari- 
ables, for instance by Campos et al. (2003), Maulik, Resnick and Rootzen 
(2002), Resnick (2001, 2002), Zhang, Breslau, Paxson and Shenker (2002) 
and many others. These studies have shown that some variables are tail 
dependent, while some variables are tail independent. 

Our application study tests tail (in)dependence of the joint behavior of 
large values of three internet traffic variables: size of response, time duration 
of response, and throughput (rate = size/time), which are in the context of 
HTTP (web browsing) responses. The data set was gathered from the Uni- 
versity of North Carolina main link during April of 2001. The data set used 
in this analysis was for "morning" block (8:00AM-12:01PM). We obtained 
76325 subsets of which there are 500 data points in each subset. 

We performed the gamma test (using the rank based tail quotient cor- 
relation coefficient) on each subset. We found tail dependence between size 
of response and throughput in more than 3 percent of the data subsets. 
However, we did not find any statistical evidence that there is tail depen- 
dence between time duration and throughput. In Figure 2 we plot the tail 
behaviors between the traffic variables. The pattern in the right panel is 
discovered by the tail quotient test statistic. The data which generate the 
left panel is tested as tail independent. The data which generates the right 
panel is tested as tail dependent. As far as we know, such findings have not 
been reported in the literature. 

These findings certainly encourage network researchers and network de- 
signers to better take into account the variable tail dependence in the net- 
work design and network traffic study. This is just a simple application of the 
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The rank scalier plot (a) between response size aiKJ ihrougliput 



The rank scatter piol (b) between response size and ihroughpm 
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Fig. 2. Comparison between two different data sets collected at different time periods. 
The pattern in the right panel is discovered by the tail quotient test statistic. The data 
which generates the left panel is tested as tail independent. The data which generates the 
right panel is tested as tail dependent. 



gamma test in an internet traffic study. A detailed analysis of these traffic 
variables can be done, which will be a different research project. However, 
we restrict ourself to a traffic variable tail dependence test in this paper. 

8. Concluding remarks. Resnick (1997) argues "Why non-linearities can 
ruin the heavy tailed modeler's day." What we hope to provide, in this 
paper, is an enabling methodology that will allow model builders to study 
tail behaviors of underlying multivariate time series on a scale previously 
not very attainable. 

The new family of gamma test statistics can effectively lead to the right 
indications of tail (in)dependencies at high threshold levels, when the total 
sample size is about 500, as shown in our simulated examples. The gamma 
test has also given evidence of the tail independence of bivariate normal 
random variables, and of the tail independence of the sum and the maximum. 
The gamma test has outperformed Fisher's Z-transformation test when data 
are nonlinearly dependent and the sample size ranges from 37 to 100. 

The new concept of the gamma test for (tail) independence should en- 
hance the statistical theory of hypothesis testing. At the least, it adds one 
more member to the previously existing family of test statistics. 

The tests and models of tail dependencies are still far from perfect. There 
is much work yet to be done. We believe that the methodology developed in 
this paper will be very useful in the analysis of financial data. Such research 
could produce more effective financial time series model and a clearer view 
of what has been missing in option pricing models. 

In the literature, for exceedance problems, there have been existing test 
statistics derived based on empirical likelihood methods or empirical pro- 
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cesses. Among those test statistics, the null hypothesis is tail dependen- 
cies which involve pre-specified model assumptions, and hence, they can be 
thought as testing a class of models. In our setup, the null hypothesis is tail 
independence. As a result, it is not practically useful to compare our test 
with existing tests, and it is not easy to carry out. 

When the null hypothesis of (tail) independence is rejected, q^ (or qu„) 
can be used as a (tail) dependence measure or as an estimate of a tail de- 
pendence index. The properties of this new measure may need to be studied 
further. A generalization of qun based on max-stable processes may be worth 
investigation. 

The gamma tests and the (tail) dependence measures can be applied to 
many areas, for instance, financial study, internet traffic analysis (as men- 
tioned above), environmental science, geophysics, microarray data analysis 
and psychological behavior study are just a few out of many areas. 

9. Technical arguments. 

Proof of Theorem 3.1. First we have F{Yi/Xi <t} = t/{l + 1), which 
implies 

lim p|n"^ (maxYi/Xi + l] < x\ = e~^/^. 

To prove (3.2), we compute the following probabilities: 

F\n'^(maxYi/Xi + 1 ) < x, n~^[maxXi/Yi + 1 ) < y\ 

L V *<" / V i<'^ / J 

F{X,/{ny - 1) < y, < (nx ■ 

/•oo /•(nx—l)z 1 '^ r 1 1 

Jo Jz/{ny~i) J L nx ny, 

which implies (3.2). For (2.1), we have 

ma-K{Yi/Xi} + ma-K{Xi/Yi} - 2 



-1)^ 


J 


n 


1- 



nq„ 



n- 



iuax{Y,/X.i}uiax{Xi/Yi} - 1 
Yi 



n 



X n 



+ 1 j + n ^{ max<^ —^ 



+ 1 
+ 1 



I max 
V VX, 

|} + l)n-(max{| 

[n-^{max{Yi/Xi} + 1) + n-^(max{X,/yj + 1)][1 + Op(l) 
[n-i(max{y,/Xi} + l)n-\max{Xi/Yi} + 1)][1 + Op{l)] 



n 



n 



+ 1 +n- 



I max 



{f} 



+ 1 



-1 
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which gives (2.1) by the Cramer- Wold device and Slutsky's theorem. D 

Lemma 9.1. Suppose X, Xi,X2,. ■., are positive random variables. Then 
Xn —^ X if and only if there are two sequences of random variables (,i{n), 
^2i'n) such that ^i{n) < Xn/X < ^2('^); n = l,2, . . . , and^i{n) —^ 1, ^2('t-) ——^ 
1, as n^> oo. 

Proof. The sufficient condition is obvious. The necessary condition can 
be shown by simply defining ^i(n) = Xn/X * (1 — 1/n), and ^2(^) = Xn/X * 
(1 + 1/n). D 



Proof of Theorem 3.2. Denote Xi = F~^{Ge{ei)), Yi = F~'^{G.n{vi)), 
then Xi —^ Xi, Yi —^ Yi, as n — > oo and, hence, by Lemma 9.1, there exist 
Cj{n) > 0, ^j{n) ^-4 1, J = 1,2,3,4, as n — > oo, and 



X Y 



which implies 

P< max-———— <nx — l,max-— — — — <ny — 1> 

[i<n Xi^i[n) i<n Yi ^s[n) J 

< P < max -J- < nx — 1, max -^ < ny — 1> 

~ I i<n Xi ~ i<n Yi ~ J 

< P< max-———— <nx— l,max--— <ny — 1>. 

I i<n Xi ^2{n) i<n Yi t,i{n) ) 

Since ij{n) / ^kin) converges to 1 almost surely for all j and k, and maxj<„ ^ 

and maxj<„ i^ are asymptotically independent, so by Slutsky's theorems, 
we have both the first probability and the last probability in the above 
inequalities converging to e~^'^~^'^ as n— > oo, and hence, (3.4) is true. The 
proof of the asymptotic distribution of nq„ is similar to the proof of the 
asymptotic distribution nq„ in Theorem 3.1. D 



Proof of Theorem 5.1. We prove q„ —^ 0, as n ^ oo only. The proof 
of q^n —-^ is similar. 

Suppose without loss of generality that there is a finite number z, 1 < z < 
oo, such that Y'{X/Y <z) = l. We have 

P(X > n, y > n) _ V{zY >X>u,zY> zu) 
¥{Y>u) ~ P(X > u) 

_ V{zY > X > zu, zY > zu) 
~ P{X > u) 
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P{zY > X > u, zY > zu,u < X < zu) 



+ 



P(X > u) 



^ P(X >zu) 1 ^ 
P(a > u) z 

which contradicts with the zero hniit on the left-hand side. So both ratios 
X/Y and Y/X have support over (0, oo). So maxj<„, -^ -^ oo, maxi<„ ^ -^ 

oo, as n ^ oo, and hence, q„ — ^ 0. D 

Proof of Theorem 5.2. We first prove (5.4). For t < 1, we have 

lu + Vi J \u + {Z2-u)Iz^>u 

= P| ^ < t, Zi > u, ^2 > u| + P I ^ < t, Zi < n, Z2 > M 

+ P|— <t,^i >n,Z2<nl+P{l<t, Zi<'u, Z2<'u} 

= Ai + A2 + A3 + A4, 
where A3 = 0, A4 = and 
Ai = F{u <Zi< tZ2} 

00 /-oo 

u Jzi/t 

roo 1 

Ju Z-^ 

/■oo 1 1 

A ^? 1 + t^ ^ 

A2 = P{Zi < n, Z2 > n/t} = e-^/"(l - e"*/") = e"^/" - e-(^+*)/". 

Adding Ai and A2, we get (5.4) for < t < 1. 
For t > 1 , we have 



A1 = P(^< Y<^2,^2>n 



tu /"CXD /"CO /"OO 



1+t 1+t 
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A2 = F{Zi < u, Z2 > u} = e"^/"(l - e-i/") = e"^/" - e^^/", 

A3 = F{u <Zi< tu, Z2<u} = e-i/«(e-i/M _ g-V") = g-(i+i)/M _ ^~2/u^ 

A4 = F{Zi <u, Z2<u} = e-2/«. 

Adding the above terms together, we thus obtain (5.4). 
We now prove (5.5). Since 

Fi -— <nx -1, — — <ny -1 

[u + Vi u + Wi 

(u + Wi u + Vi 

= F< — — <nx- 1, — — — <ny-l,Xi>u,Yi>u 

[ u + Vi u + Wi 

+ P\ — -— <nx-l, <ny -l,Xi> u,Yi < u 

^{u + Wi u + Vi 

+ P <^ -f^ <nx-l, <ny-l,Xi<u,Yi>u 

{u + Wi u + Vi 

+ P<^ ——<nx-l, <ny-l,Xi<u,Yi<u 

lu + Vi u + Wi 

where 



(Yi Xi 



Ai = P<^ -— < nx - 1, -— < ny - 1, Xi > n, Yi > fi 

00 i'{nx—l)x\ 
u Jnia.x.{xi/(ny~l),u} 



Any~l)u /•(nx—l)xi roo p{nx—l)xi 

Ju Ju J (nv—l)u Jxi /(nv—1) 



^ nx-l _ nx- l ^-(nx)/(nx-l)u , g-2/n I ^^^ ~ '^ ^~(ny)/(ny-l)u 

nx nx ny ny 

A2=P\^r- <nx-l,— <ny-l,Xi>u,Yi<u 
[Xi u 

= F{u <Xi< {ny - l)u, Yi<u} = e-^yl^^y-^> _ g-^/", 

A3 = p(^<nx-1, ^<ny-l, Xi < u,yi > nl = e-"^/("^-i)" - e'^/", 
Vu Yx J 

and A4 = e~^'". Therefore, 

P< -— <nx — \, — — <ny — 1 
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A1 + A2 + A3 + A4 
1 
nx ny nx 



1 



_ _|_ _^-nx/(nx-l)u _|_ J_^ 

ny 



-ny/(ny-l)u 



Since 



Fin 



max - — — — - + 1 

i<n [u + Vij 



< X, n 



jn + Vi) , ; 

max - — — — - + 1 

i<n (u + Wi) 



<y 



: P"<! ^ <nx-l, -^ <ny-l 



u + Vi 



u + Wi 



I I ^-nx/{nx-l)u _| ^-ny/{ny-l)u 

nx ny nx 

,-(l-e-i/")/a:-(l-e-i''")/j/ 



ny 

as n ^ 00, 



which gives (5.5)-(5.7). 

For (4.1) and (5.8), letting T^ = {u + Wi)/{u + Vi), Qi = {u + Vi)/{u + Wi), 
we have 



nq^ 



: n 



max{{u + Wi)/{u + Vj}} + max{(u + Vi)/{u + Wj)} - 2 
max{(M + Wi)/iu + Vi)} max{(u + V^)/{u + Wi)} - 1 

= (n~^(max{Ti} + 1) + n~^(max{ei} + 1) - 4/n) 

X (n-^(max{Ti} + l)n-^(max{ei} + 1) 

- n-i[n~^(max{Ti} + 1) + n-^(max{ei} + 1)])"^ 

_ [n-^maxjTJ + I) + n-\max{ei} + !)][! +Op(l)] 
[n-i(max{Ti} + l)n"i(max{ei} + l)][l+Op(l)] ' 

which gives (4.1) by the Cramer- Wold device and Slutsky's theorem. So the 
proof of the theorem is complete. D 

Proof of Theorem 5.3. Using Theorem 3.5.2 of Leadbetter, Lindgren 
and Rootzen (1983) and Proposition 3.1 of Davis (1982), we immediately 
get Mn = max(^i, ^2, • • • , ^n) and Wn = min(^i, ^2, ■■■,^n) are asymptotically 
independent. Then the proof of (3.3) follows from Theorem 3.1, and the proof 
of (5.8) follows from Theorem 5.2. The last part follows from Theorem 3.2. 
D 
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